Automated Transformation of Semi-Structured Text Elements

نویسندگان

  • Johannes Heurix
  • Antonio Rella
  • Stefan Fenz
  • Thomas Neubauer
چکیده

Interconnected systems, such as electronic health records (EHR), considerably improved the handling and processing of health information while keeping the costs at a controlled level. Since the EHR virtually stores all data in digitized form, personal medical documents are easily and swiftly available when needed. However, multiple formats and differences in the health documents managed by various health care providers severely reduce the efficiency of the data sharing process. This paper presents a rule-based transformation system that converts semi-structured (annotated) text into standardized formats, such as HL7 CDA. It identifies relevant information in the input document by analyzing its structure as well as its content and inserts the required elements into corresponding reusable CDA templates, where the templates are selected according to the CDA document type-specific requirements.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Presenting Semi-Structured Text Retrieval Results

DEFINITION Presenting semi-structured text retrieval results refers to the fact that, in semi-structured text retrieval, results are not independent and a judgment on their relevance needs to take their presentation into account. For example, HTML/XML/SGML documents contain a range of nested sub-trees that are fully contained in their ancestor elements. As a result, semi-structured text retriev...

متن کامل

Leveraging Human Intelligence: Semi-automated Processing in Assuring Access to Digital Content

Need for standardization in the content production industry have led producers of popular authoring and publishing applications to adopt structured mark-up languages, such as XML, to implement their content file formats. As part of our effort to ensure long term access to such content, we need to consider properties of the mark-up schemas and devise methods to enable effective mapping among the...

متن کامل

Web Entity Detection for Semi-structured Text Data Records with Unlabeled Data

We propose a framework for named entity detection from Web content associated with semi-structured text data records, by exploiting the inherent structure via a transformation process facilitating collective detection. To learn the sequential classification model, our framework does not require training labels on the data records. Instead, we make use of existing named entity repositories such ...

متن کامل

Structured Text Modification Using Guided Inference

We describe a technique that allows end-users to specify automated transformations of structured text by inferring an underlying model. Inference is achieved with a novel algorithm, Structured Prediction by Partial Match (SPPM), a generalisation of the well-known PPM approach to predictive text entry and compression. We created two simple applications, as examples of "first steps" end-user prog...

متن کامل

Entity recognition and resolution in semi-structured data

Potentially usable business information exists in unstructured form. This information, although machine readable, resides in unstructured human language texts that are difficult to process by computers. Within this information are references to real world entities, which are the focus of this paper. More specifically, we address the recognition of references to entities and their resolution, in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012